一组解决方案中的多元化已成为进化计算社区中的热门研究主题。事实证明,它有益于以多种方式优化问题,例如计算一套高质量的解决方案并获得不完美建模的鲁棒性。在文献中,我们首次适应了现实世界中的组合问题的进化多样性优化,即患者的入学计划。我们引入了一种进化算法,以在每种溶液质量的一组解决方案中实现结构多样性。我们还引入了一个突变操作员,偏向于多样性最大化。最后,我们通过模拟证明了多样性对上述问题的重要性。
translated by 谷歌翻译
在处理机器人技术,游戏和组合优化等领域的问题时,质量多样性(QD)算法已被证明非常成功。它们的目的是最大程度地提高基本问题所谓行为空间不同区域的解决方案的质量。在本文中,我们应用QD范式来模拟背包问题上的动态编程行为,并提供对QD算法的第一个运行时分析。我们证明他们能够在预期的伪多项式时间内计算最佳解决方案,并揭示导致完全多项式随机近似方案(FPRAS)的参数设置。我们的实验研究根据在行为空间中构建的解决方案以及获得最佳解决方案所需的运行时评估了经典基准集的不同方法。
translated by 谷歌翻译
最近,已经开发了不同的进化计算方法,该方法为给定优化问题生成了一组高质量的解决方案。许多研究都认为多样性1)是探索行为空间(质量多样性)或2)以增加解决方案的结构差异(进化多样性优化)的平均值。在这项研究中,我们引入了一种共同进化算法,以同时探索多组分旅行小偷问题的两个空间。结果表明,与文献的基线进化多样性算法相比,共同进化算法具有明显更高多样性的能力。
translated by 谷歌翻译
在真实世界优化中,常见的是面对几个次级问题,互动和形成主要问题。子问题之间存在依赖性,使得不可能通过专注于一个组件来解决这样的问题。旅行小偷问题〜(TTP)属于此类别,由旅行销售人员问题〜(TSP)和背包问题〜(KP)形成。在本文中,我们通过优质多样性〜(QD)方法研究了TSP和KP的依赖性。 QD算法提供强大的工具,不仅可以获得高质量解决方案,还提供了在行为空间中的高性能解决方案的分布。我们使用众所周知的TSP和KP搜索操作员介绍基于Map-Elite的进化算法,将TSP和KP得分作为行为描述符。之后,我们进行全面的实验研究,表明使用应用于TTP的QD方法的有用性。首先,我们提供有关TSP / KP行为空间中高质量TTP解决方案的见解。之后,我们表明,通过使用我们的QD方法可以获得更好的TTP解决方案,并显示它可以改善用于在文献中基准测试的广泛TTP实例的最佳已知解决方案。
translated by 谷歌翻译
Transformer variants dominate the state-of-the-art in different natural language processing tasks such as translation, reading comprehension and summarization. Our paper is more directed to use general memory slots added to the inputs and studying the results of adding these slots. This paper is a go on study of general memory slots rule that were added to the input of the proposed model in previous work. We have two main tasks;1) pretraining task using masked language modeling and b) fine tuning task using HotpotQA . This study aims to verify the ability of the proposed model to handle chunks as if they were one chunk comparing with the base model. As baseline we used T5 transformer. We studied the rule of memory slots augmented to each input chunk and studied the model performance without selector. We found that adding memory to input chunks helped the proposed model to overcome the baseline on Masked language modeling task with specific training parameters. Ablation study reveals the ability of using the compressed input chunks with a degradation in performance.
translated by 谷歌翻译
Modern deep neural networks tend to be evaluated on static test sets. One shortcoming of this is the fact that these deep neural networks cannot be easily evaluated for robustness issues with respect to specific scene variations. For example, it is hard to study the robustness of these networks to variations of object scale, object pose, scene lighting and 3D occlusions. The main reason is that collecting real datasets with fine-grained naturalistic variations of sufficient scale can be extremely time-consuming and expensive. In this work, we present Counterfactual Simulation Testing, a counterfactual framework that allows us to study the robustness of neural networks with respect to some of these naturalistic variations by building realistic synthetic scenes that allow us to ask counterfactual questions to the models, ultimately providing answers to questions such as "Would your classification still be correct if the object were viewed from the top?" or "Would your classification still be correct if the object were partially occluded by another object?". Our method allows for a fair comparison of the robustness of recently released, state-of-the-art Convolutional Neural Networks and Vision Transformers, with respect to these naturalistic variations. We find evidence that ConvNext is more robust to pose and scale variations than Swin, that ConvNext generalizes better to our simulated domain and that Swin handles partial occlusion better than ConvNext. We also find that robustness for all networks improves with network scale and with data scale and variety. We release the Naturalistic Variation Object Dataset (NVD), a large simulated dataset of 272k images of everyday objects with naturalistic variations such as object pose, scale, viewpoint, lighting and occlusions. Project page: https://counterfactualsimulation.github.io
translated by 谷歌翻译
Continual Learning is a step towards lifelong intelligence where models continuously learn from recently collected data without forgetting previous knowledge. Existing continual learning approaches mostly focus on image classification in the class-incremental setup with clear task boundaries and unlimited computational budget. This work explores Online Domain-Incremental Continual Segmentation~(ODICS), a real-world problem that arises in many applications, \eg, autonomous driving. In ODICS, the model is continually presented with batches of densely labeled images from different domains; computation is limited and no information about the task boundaries is available. In autonomous driving, this may correspond to the realistic scenario of training a segmentation model over time on a sequence of cities. We analyze several existing continual learning methods and show that they do not perform well in this setting despite working well in class-incremental segmentation. We propose SimCS, a parameter-free method complementary to existing ones that leverages simulated data as a continual learning regularizer. Extensive experiments show consistent improvements over different types of continual learning methods that use regularizers and even replay.
translated by 谷歌翻译
在大型数据集上,对视力任务的深度学习模型进行了培训,因为存在一个通用表示,可用于对所有样本进行预测。尽管事实证明,高复杂性模型能够学习此类表示,但对数据的特定子集进行了培训的专家,可以更有效地推断出标签。然而,使用专家的混合物会提出两个新问题,即(i)在提出新的看不见的样本时分配正确的专家。 (ii)找到培训数据的最佳分区,以使专家最依赖于共同特征。在动态路由(DR)中,提出了一个新颖的体系结构,其中每层由一组专家组成,但是在没有解决这两个挑战的情况下,我们证明该模型可以恢复使用相同的专家子集。在我们的方法中,对多元化的动态路由(DIVDR)进行了明确培训,以解决找到数据相关分区并以无监督的方法分配正确的专家的挑战。我们对MS-Coco的城市景观和对象检测以及实例分割进行了几项实验,显示了几个基线的性能的改善。
translated by 谷歌翻译
我们提出了一个新的灵敏度分析模型,该模型结合了Copulas和在未观察到的混杂状态下的因果推断的标准化。我们将新模型称为$ \ rho $ -gnf($ \ rho $ - graphical正常化流),其中$ \ rho {\ in} [ - 1,+1] $是一个有界灵敏度参数,表示后门非 - 由于未观察到的混杂而引起的因果关系,使用研究最丰富且广泛流行的高斯副群建模。具体而言,$ \ rho $ -gnf使我们能够估计和分析前门因果效应或平均因果效应(ACE)作为$ \ rho $的函数。我们将其称为$ \ rho_ {curve} $。 $ \ rho_ {curve} $使我们能够指定无王牌所需的混杂力量。我们将其称为$ \ rho_ {value} $。此外,$ \ rho_ {curve} $还使我们能够为$ \ rho $ values的间隔提供ACE的界限。我们说明了$ \ rho $ -gnf的好处,并通过对我们的经验王牌界限的实验比其他流行的王牌范围更狭窄。
translated by 谷歌翻译
作物疾病显着影响农业生产的数量和质量。在精确农业的目标是最大程度地减少甚至避免使用农药的目的,具有深度学习的天气和遥感数据可以在检测作物疾病中发挥关键作用,从而允许对农作物的局部治疗。但是,将天气和图像等异质数据结合在一起仍然是一个热门话题和具有挑战性的任务。变压器体系结构的最新发展显示了从不同领域(例如文本图像)融合数据的可能性。当前的趋势是仅定制一个变压器来创建多模式融合模型。相反,我们提出了一种使用三个变压器实现数据融合的新方法。在本文中,我们首先通过使用ConvlstM模型来插值来解决缺失的卫星图像问题。然后,提出了一种多模式融合体系结构,该体系结构共同学习处理视觉和天气信息。该体系结构是由三个主要组件,一个视觉变压器和两个变压器编码器构建的,可以融合图像和天气方式。所提出的方法的结果有望达到97 \%的总体准确性。
translated by 谷歌翻译